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against  backgrounds  of  another  color.  1 

Two  studies  totalling  32064  practice  and  experimental  trials  were  conducted. 
The  first  study  determined  that  the  CIE  Uniform  Color  Spaces  are  inappropriate 
for  the  modelling  of  RS.  Subsequently,  a  different  3-space  geometry  and 
colorimetric  component  scaling  were  empirically  derived  from  the  Study  1  data 
to  produce  a  one -dimensional  AE  scale  which  approximates  an  interval  scale  of 
RS.  This  AE  scale  and  others  were  then  applied  to  the  different  stimulus 
conditions  ^in  Study  2  to  determine  the  generalizability  of  such  AE  metrics. 

The  pajir  of  studies  is  conclusive:  several  AE  scales  exist  which  serve 
equally  wejll  to  describe  or  prescribe  RS  with  multicolor  CRT  raster  imagery 
for  a  range  of  character  luminances  in  both  positive  and  negative  presentation 
polarities.  These  are  the  Y,u' ,v' ,  logY,u',v',  L*,  u' ,v' ,  and  L*,u*,v*  rescalejd 
color  spaces.  Because  of  its  predictive  accuracy  and  simplicity,  a  luminance- 
generalized,  AE-standardized  Y,u* ,v*  metric,  accounting  for  71%  and  75%  of  the 
RS  variability  in  Studies  1  and  2,  respectively,  is  recommended  as  the  most 
appropriate  metric  of  emissive  display  legibility  to  be  tested  in  these  studiesj. 
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INTRODUCTION 


Overview 


The  recent  proliferation  of  single-  and  multi-color 
electronic  displays  and  display  technologies  is  accompanied 
by  a  growing  need  to  understand  how  best  to  employ  them. 
Galves  and  Brun  (1982)  indicated  the  following  potential 
advantages  for  multichrome  over  monochrome  technologies  in 
luminous  data  displays:  increase  in  displayable  information 
density,  greatly  reduced  data  acquisition  time,  greatly 
reduced  risk  of  error  in  symbol  and  number  identification, 
and  the  possibility  of  color-coding,  supplementing 
information  without  requiring  a  shape-coding  system. 


Color  producibility  is  restricted  by  the  limitations  of 
specific  display  technologies.  Also,  available  colors  are 
often  displayed  with  primary  concern  for  the  subjective 
appreciation  of  display  users,  but  not  for  objective,  task- 
related  visual  performance  measures.  Although  the  improper 
or  sub-optimal  usage  of  display  colors  may  be  of  little 
consequence  in  many  applications,  there  exist  military, 
industrial,  and  educational  settings  in  which  the  considered 
usage  of  simultaneously  displayed  colors,  generally  in  the 
form  of  alphanumerics  and  background  fields,  may  be  a 
deciding  factor  in  the  comfortable,  timely,  or  even 


successful  completion  of  a  mission  or  process  the  display  is 
intended  to  support. 

One  critical  example  is  the  mission  requirements  of  an 
aircraft  equipped  with  a  head-up  display  (HUD).  Its 
electronic  display-generated  flight  control  or  weapons 
delivery  information  is  made  available  to  the  pilot  in  the 
form  of  visual  symbols  optically  combined  with  the  pilot's 
normal  field  of  view  through  the  windscreen.  The  design  of 
a  HUD  is  tested  through  many  hours  of  in-flight  operation. 
It  is  successful  to  the  extent  that  a  pilot  can  quickly  and 
accurately  extract  (read)  luminous  information  displayed 
against  the  real-world  background--a  dynamic,  spatially 
complex  visual  field  varying  greatly  in  luminance  and 
chrominance.  For  this  and  similar  applications,  a  means  to 
predict  the  performance  effects  of  displayed  color  contrast 
would  aid  designers  of  multichrome  display  systems. 

A  study  by  Lippert  (1984),  which  will  be  denoted  "Study 
1"  throughout  this  report,  focused  on  reading  speed  (RS)  and 
accuracy  in  tasks  requiring  strings  of  dot-matrix  numerals 
displayed  against  spatially  uniform  backgrounds  to  be  read 
from  a  static,  full-color  simulated  cathode-ray  tube  (CRT) 
HUD.  RS  is  the  reciprocal  of  the  time  required  to 
accurately  read  a  random  numeral  string  (98%  correct 
responses)  and  is  therefore  a  proportional  measure  of 
improvement  in,  or  ease  of,  legibility.  The  numeral  reading 
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task  is  free  of  the  language  redundancy  confounds  associated 
with  word  reading  tasks.  Instead  of  the  pilot's  view 
through  the  windscreen,  the  HUD  was  programmed  into  a 
computer-generated  real-world  representation  composed  of  10 
geometric  fields  corresponding  to  sky,  clouds,  and  terrain, 
implying  an  horizon.  Each  of  the  independently  addressable 
background  fields  could  be  assigned  any  color  attainable 
with  the  selected  color  monitor.  Therefore,  the  operational 
definition  of  color  contrast  for  that  study  is  the 
combination  of  luminance  and  chrominance  difference  between 
a  given  reading  task  numeral  string,  or  target  (T),  and  its 
immediate  background  field  (B)  which  distinguishes  one  from 
the  other  both  in  terms  of  colorimetric  parameters  and  RS. 
The  investigation  was  undertaken  to  describe  better  the 
stimulus-response  relationships  of  these  two  variables. 

Study  2,  described  in  detail  in  this  report,  is  an 
investigation  of  the  generalizability  of  displayed  data 
legibility  predictors,  empirically  derived  from  RS  in  Study 
1,  to  different  stimulus  levels.  The  laboratory  apparatus 
and  experimental  method  are  the  same  as  for  Study  1  and 
Study  2.  In  Study  2,  reading  speed  as  a  function  of  color 
contrast  was  determined  for  achromatic,  yellow-green,  and 
red  random  numeral  strings  3,  4,  and  5  digits  in  length  as 
in  Study  1,  but  ,Tiewed  at  0.76  m  as  opposed  to  0.5  m  in 
Study  1.  The  experiment  also  incorporated  both  positive 
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(e.g.,  lighter  targets  on  a  darker  background)  and  negative 
(e.g.,  darker  targets  on  a  lighter  background)  polarities  as 
opposed  to  Study  l's  limitation  to  positive  polarities  only. 
Negative  polarity  was  included  to  answer  questions  about 
color  contrast  formatting  in  head-down  displays  and  other 
daylight  displays  not  constrained  to  the  HUD's  positive 
polarity . 


Color  Science 

Because  the  prediction  of  visual  task  performance  from 
quantified  color  displays  depends  on  color  specifications, 
the  following  discussion  is  pertinent.  The  subject  of  color 
incorporates  both  the  physical  parameters  of  colors  and  the 
visual  experience  or  sensations  they  produce.  As  a  broad 
psychophysiological  concept,  color  includes  not  only  the 
sensation  of  hues,  but  also  grays  which,  as  characteristics 
of  light,  may  be  described  objectively  in  terms  of 
photometric  quantity,  dominant  wavelength,  and  excitation 
purity.  These  three  dimensions  roughly  correspond  to  the 
subjective  visual  attributes  of  brightness,  hue,  and 
saturation,  respectively. 

The  most  practical  standard  white  light  has  traditionally 
been  direct  noon  sunlight,  although  atmospheric  pollution 
has  necessitated  its  replacement  with  Standard  Illuminants  C 
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or  D.  White  light  is  termed  achromatic  since  it  contains 
all  visible  wavelengths  in  proportions  of  equally  perceived 
intensity. 

Most  colors  are  chromatic,  however,  in  that  they  exhibit 
hue,  which  is  directly  related  to  the  differences  between 
their  spectral  energy  distributions  and  that  of  sunlight  (or 
an  established  substitute).  The  result  of  these  differences 
is  the  sensation  of  a  "greenish,"  "reddish,"  etc.,  hue. 
Color  scientists  have  determined  that,  according  to  the 
average  observer's  judgment,  the  common  names  of  pure 
spectral  hues  should  be  applied  to  the  wavelength  ranges  in 
Table  1. 

Color  additivity.  The  Young-Helmholtz,  or  trichromacy, 
theory  has  been  one  relatively  successful  means  of 
explaining  many  color  vision  phenomena.  Developed  at  about 
the  turn  of  the  19th  century,  it  holds  that  the  human  eye  is 
capable  of  three  separate  color  sensations,  corresponding  to 
retinal  stimulation  by  red,  green,  and  blue  wavelengths. 
These  sensations  combine  so  that  every  color  sensation  is 
the  effect  of  the  joint  stimulation  of  three  discrete 
retinal  elements  in  some  definite  proportion.  Red,  green, 
and  blue  are  thus  known  as  the  additive  primary  colors. 

Many  refinements  have  been  made  to  this  concept,  which 
actually  originated  with  Newton,  who  mechanically  modeled 
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Table  1.  Common  Names  of  Pure  Spectral  Hues 
(Wyszecki  and  Stiles,  1967). 


COLOR 

Wavelength,  nm 

Violet 

390-455  (complement) 

Blue 

455-492 

Green 

492-577 

Yellow 

577-597 

Orange 

597-622 

Red 

622-770 

the  primaries  as  weights  placed  at  the  vertices  of  a 
triangular  plate.  The  resultant  color  perception  was 
analogous  to  the  center  of  gravity  of  the  system,  though 
Newton  believed  yellow  was  a  primary  color.  Figures  1,  2, 
and  3  represent  a  progression  of  concepts  to  the  present 
working  model  of  color  additivity. 


Maxwell  conceived  the  color  triangle,  clarifying  the 
concept  of  color  additivity  with  an  equilateral  triangle  in 
which  the  altitude  from  each  primary  to  the  opposite  side 
constitutes  a  value  of  100.  The  proportionate  lengths  of 
the  altitudes  intersecting  at  any  color  within  the  triangle 
were  the  proportions  of  the  respective  primaries  to  be  mixed 
to  duplicate  the  color.  For  color  H  in  Figure  1,  the 
distances  r,  g,  and  b  represent  the  proportions  of  the 
primaries  R,  G,  and  B  which,  when  added  together,  will 
imitate  H.  Removing  a  primary  color's  spectral  energy  from 
full-spectrum  white  light  yields  the  subtractive  complement 
shown  opposite  it  (i.e.,  yellow  results  from  the  subtraction 
of  blue  from  white). 


Grassman  (1853)  formalized  the  laws  of  color  mixture  in 
three-dimensional  tristimulus  space  by  intersecting  the 
color  triangle  at  points  of  equal  perceptual  intensity  along 
the  primaries'  intensity  vectors,  R,  G,  and  B,  creating  the 
unit  (intensity)  plane,  or  chromaticity  diagram  (Figure  2). 
Tristimulus  space  is  useful  because  most  colors  may  be 


specified  in  terms  of  the  simple  additive  combination  of 
quantities  of  each  primary  constituting  them  (e.g.,  the 
tristimulus  values  of  a  color  of  interest) .  The  added 
tristimulus  values  of  two  colors  specify  a  third  color  which 
is  a  perceptual  match  to  the  mixture  of  the  first  two.  This 
innovation  made  the  quality  control  of  mixing,  matching,  and 
maintaining  the  colors  of  pigmented  or  reflective  sources 
more  precise,  easily  communicated,  and  even  routine. 

However,  color-matching  experiments  (see  Graham,  1965,  p. 
370),  using  the  red,  green,  and  blue  primaries  (at 
wavelengths  700,  546.1,  and  435.8  nm:  the  "RGB  System"), 
consistently  showed  that  one  of  the  reference  (primary) 
stimuli  had  to  be  added  to  the  test  (to-be-matched) 
stimulus,  effectively  desaturating  it,  in  order  for  a  match 
to  be  made  of  some  test  colors.  In  other  words,  there  were 
some  (pure)  colors  which  could  not  be  matched  through  the 
additive  mixture  of  the  primaries,  and  this  left  the  science 
of  color  additivity  incomplete. 

A  solution  was  found  by  the  Commission  Internationale  de 
l'Eclairage  (CIE)  in  1931  by  specifying  imaginary  primaries 
such  that  all  real  colors  could  be  achieved  through  their 
additive  combination.  This  system  uses  the  labels  X,  Y,  and 
Z  for  the  primary  vectors,  creating  the  chromaticity  diagram 
seen  in  Figure  3,  which  describes  normal  human  trichromatic 
additivity  relative  to  these  mathematical  primaries.  The 
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Figure  3. 


1931  CIE  tristimulus  space. 


Standard  Observer  for  Colorimetry  (1931  2-degree  Standard 
Observer)  is  defined  by  this  convention,  derived  from  data 
from  two  color  matching  studies  (Guild,  1931;  Wright, 
1928-29),  which  are  considered  the  best-controlled  to  date 
and  which  provide  the  current  colorimetric  standard. 

All  pure  spectral  colors  lie  on  the  spectrum  locus, 
except  for  the  purples  across  the  bottom  of  the  space,  which 
are  all  additive  combinations  of  blues  and  reds  and  which 
may  be  specified  in  complementary  wavelengths--the  negative 
of  the  spectrum  locus  wavelength  colinear  with  the  purple 
stimulus  chromaticity  coordinates  and  E,  or  Standard 
Illuminant  C,  the  neutral  reference  color.  The  ratios  of 
each  of  the  tristimulus  values,  X,  Y,  Z,  to  their  sum,  (X  + 
Y  +  Z),  are  the  chromaticity  coordinates,  x,  y,  z,  which 
thus  add  to  unity  for  every  color.  They  specify  a  color's 
dominant  wavelength  and  excitation  purity  (e.g.,  the 
proportionate  colinear  distance  from  E  to  the  spectrum  locus 
for  a  color),  but  do  not  indicate  its  perceived  intensity, 
or  brightness. 

Earlier  work  by  Gibson  and  Tyndall  (1923)  and  Coblentz 
and  Emerson  (1917)  had  determined  the  physical  intensities 
of  many  pure  spectral  hues  required  to  match  the  brightness 
of  a  white  stimulus  of  known  physical  intensity.  This 
resulted  in  the  establishment  of  the  relative  luminosity 
function  of  wavelength  (e.g.,  CIE  1924  photopic  luminosity 


function).  The  spectral  energy  distribution  of  any  light 
source  may  be  weighted  by  the  luminosity  function,  then 
integrated  across  its  spectrum,  yielding  the  measure  termed 
luminance  (L).  Tristimulus  space  was  constructed  such  that 
Y  may  be  scaled  to  represent  luminance  (more  will  be  said  of 
the  empirical  relationship  between  brightness  and  luminance 
later).  Y,  x,  and  y  are  defined  to  be  mathematically 
orthogonal  and  may  therefore  be  combined  to  form  a  three- 
dimensional  coordinate  space  comprised  of  a  chromaticity 
diagram  normal  to  a  luminance  axis. 

Uniform  color  scales .  While  tristimulus  space  and  its 
Y,x,y  variant  serve  as  satisfactory  reference  constructs  of 
the  physical  parameters  of  colors,  their  axes  are  not 
perceptually  independent.  Moreover,  they  were  not  designed 
to  be  perceptually  uniform.  Perceptual  uniformity  would  be 
achieved  by  a  space  in  which  a  unit  of  linear  distance, 
regardless  of  its  location  or  orientation  with  respect  to 
the  axes,  represented  an  invariant  measure  of  effective 
color  contrast  or  perceptual  difference.  (The  uniform  space 
sought  in  the  present  research  is  one  in  which  distance 
represents  an  interval  scale  of  perceived  color  contrast 
measured  indirectly  by  RS .  ) 

Since  the  establishment  of  tristimulus  space,  much  work 
has  gone  into  the  adjustment  of  the  geometrical 
relationships  among  its  color  coordinates  in  order  to  render 


(13) 


it  perceptually  uniform  with  regard  to  a  restricted  set  of 
color  vision  phenomena  (i.e.,  color-matching  behavior).  In 
1976,  the  CIE  adopted  the  UCS  (Uniform  Color  Scale)  as  an 
improvement  over  the  x,y  chromaticity  diagram  in  mapping 
color  saturation  data.  The  UCS  Psychometric  Chromaticity 
Coordinates,  u'  and  v'  ,  are  derived  from  CIE  1931 
tristimulus  values  by  the  equations, 

u'  =  4X  /  (X  +  15Y  +  32),  and  (1) 

v'  =  9Y  /  (X  +  15Y  +  32) .  (2) 

Also  in  1976,  L* ,  or  CIE  metric  lightness,  was  accepted  as  a 
uniform  perceptual  scale  of  the  intensity  of  colors  derived 
from  the  Munsell  Color  System  "Value"  dimension,  and  is 

defined  by  the  equation 

L*  =  116( Y  /  Yn)1/3  -  16,  (3) 

(Y  /  Yn)  >  0.01, 

where  Y  is  the  Y-tristimulus  value  of  a  given  color  and  Y 

n 

is  that  of  a  reference  white.  Because  L*  was  developed  for 
surface  colors  (reflective  surfaces),  the  value  of  Y  was 
generally  set  at  the  reflectance  of  white,  which  is 

maximally  100%.  (For  the  present  studies,  Y^  is  arbitrarily 
set  at  100  for  the  chromaticity  coordinates  of  Standard 
I lluminant  C . ) 
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An  early  developed  color  space  is  CIE  1976  (L*,  u*,  v*). 


in  which 


u*  =  13L*  (u'  -  u'  )  ,  and 
\  n/ 


v*  =  13L*  (v1  -  v  )  , 


where  L*  is  metric  lightness,  u'  and  v'  are  the  UCS 


coordinates  of  a  color,  and  u'  and  v'  are  the  coordinates 

n  n 


of  the  reference  white.  L*  is  seen  here  to  effect  a  linear 


projection  of  the  chromatic  space  as  a  function  of  its  own 


value,  while  the  13  corresponds  to  the  13  lightness  levels 


in  the  Munsell  value  scale. 


A  second  color  space  is  CIE  1976  (L*,  a*,  b*  )  ,  in  which 


a*  =  500( (X  /  Xn)1/3  -  (Y  /  Yn)1/3),  and 


b*  =  200 ( ( Y  /  Yn) 


1/3  -  (Z  /  Z)1/3), 


(X  /  Xn),  (Y  /  Yn),  (Z  /  Zn)  >  0.01, 


where  X,  Y,  and  Z  are  the  tristimulus  values  of  a  color  and 


X  ,  Y  ,  and  Z  are  those  of  the  reference  white, 
n  n'  n 


As  seen  above,  metric  lightness  is  the  achromatic 


intensity  scale  employed  with  either  (u*,v*)  or  (a*,b*) 


chromaticity  in  the  CIE  1976  uniform  color  spaces.  These 


two  variants  of  metric  chromaticity  differ  in  that  (u*,v*) 


is  a  linear  projection  of  1931  (X,Y,Z)  space  and  straight 


lines  in  (x,y)  correspond  to  straight  lines  in  (u*,v*)  when 
L*  is  held  constant,  while  (a*,b*)  is  a  curvilinear 
projection  of  (X, Y,Z)  and  straight  lines  in  (x,y)  are 
generally  curved  in  (a*,b*).  In  other  words,  points  of 
constant  dominant  wavelength,  the  physical  correlate  to  hue, 
fall  along  straight  lines  in  (x,y)  or  (u*,v*),  but  along 
curved  lines  in  (a*,b*).  The  existence  of  two  "uniform" 
chromatic  space  designs  stems  from  disparate  findings  in 
basic  work  on  uniform  color- spacing  structures,  or  line 
element  geodesics  (Wyszecki  and  Stiles,  1982).  The  (u*,v*) 
spacings  are  recommended  for  large  chromatic  differences  and 
the  (a*,b*)  spacings  for  small  chromatic  differences,  yet  a 
presumption  of  either  form  of  metric  chromaticity  as  more 
appropriate  for  the  present  research  application  is 
empirically  unfounded. 

A  third  proposed  color  space  is  (W,  a,  b)  (Cohen  and 
Friden,  1975;  1976).  Unlike  the  L*,u*,v*  and  L*,a*,b* 
spaces,  both  transforms  of  the  CIE  1931  tristimulus  space, 
W,a,b  coordinates  may  be  calculated  directly  from  any  valid 
color-matching  functions  (such  as  tristimulus  values)  by  a 
linear  algebraic  matrix  manipulation  procedure.  (See 
Costanza,  1981,  for  complete  computational  details.  W.  W. 
Farley  of  Virginia  Tech  has  derived  the  transform  matrix  for 
the  CIE  1931  2-degree  color-matching  functions  (x,  y,  z) 
required  to  specify  experiment  colors  in  W,  a,  b  space.) 
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Color  contrast  may  be  considered  an  additive  composite 
of  luminance  contrast  and  chrominance  contrast.  Luminance 
contrast  (e.g.,  black-and-white,  or  achromatic)  is  specified 
here  in  terms  of  luminance  modulation  (M), 

M  =  (Lmax  -  *-min)  /  Lmax  +  Lmjn)  (8) 

for  a  given  target/background  (T/B)  combination,  where  Lmax  = 
L-p  and  Lmjn  =  Lg.  Luminance  contrast  may  be  combined  with 
chrominance  contrast  to  create  a  measure  of  overall  color 
contrast  termed  color-difference  (e.g.,  AE)  ,  which  is 
computed  using  a  Pythagorean  distance  formula, 

AE(L#,u#,v*)  =  ((  AL*)2  +  (  Au*)2  +  (  Av*)2)0.5,  (9) 

where  Am*  and  Av*  are  the  differences  (distances)  between 
the  stimulus  pair's  transposed  chromaticity  coordinates  and 
AL*  is  the  difference  between  transposed  luminances  in  the 
CIELUV  space.  Similar  distance  formulas  can  be  calculated 
for  other  color  spaces. 

It  is  desirable  to  control  stimulus  chromaticities  to 
within  one  just  noticeable  difference  (JND)  of  specification 
over  the  duration  of  any  experiment.  In  this  regard,  the 
work  of  MacAdam  (19*12),  in  which  the  CIE  1931  chromaticity 
diagram  was  explored  to  determine  "equal  noticeability 


ellipses"  for  25  chromatici ties,  shown  in  Figure  4,  has  been 

consulted.  Colors  at  given  ellipse  boundaries  were  just 

noticeably  different  (based  on  the  statistical  variability 

of  observer  responses)  from  the  fixed  chromaticity  at  the 

intersection  of  the  ellipses'  axes  for  a  single  observer 

with  normal  color  vision  viewing  a  2-degree  test  spot  at  50 
2 

cd/m  .  The  ellipses  are  plotted  in  (x,y)  coordinates  times 
10  actual  size.  The  JND  values  ranged  from  0.00058  to 
0.00697  (x  or  y)  .  These  are  the  best  available  data  from 
which  to  specify  stability  tolerances  for  stimulus 
chromaticities  for  the  present  research,  yet  the 
attainability  of  such  tolerances  with  the  present 
experimental  apparatus  is  not  readily  testable,  as  discussed 
in  the  Method  section. 

Previous  color  contrast  research ■  The  difference  between 
the  brightness  (e.g.,  phenomenal  intensity)  of  a  target  and 
that  of  its  background  will  determine  to  some  extent  their 
distinctiveness,  or  perceived  contrast.  As  mentioned 
earlier,  however,  existing  color  spaces  only  approximate  the 
expected  brightness  effect  of  colors  through  their  scaling 
of  luminance,  or,  in  the  case  of  W,a,b,  achromatic 
intensity,  providing  poor-to-fair  brightness  metrics.  The 
perceived  brightness  of  a  color  is  due  not  only  to  its 
luminance,  but  also  to  its  chrominance.  Studies  conducted 
by  Booker  (1981)  and  others  have  shown  that,  for  stimuli  of 


equal  luminance,  as  the  purity  of  a  color  increases,  so  does 
its  perceived  brightness.  The  magnitude  of  this  perceptual 
phenomenon  also  varies  with  the  dominant  wavelength  of  a 
stimulus,  being  smallest  for  yellows  and  greatest  for  blues. 

Our  first  goal  in  a  long-term  color  contrast  research 

program  was  to  conceive  an  experimental  procedure  free  of 

the  wavelength-related  brightness  phenomenon.  Subjects 

brightness-matched  seven  CRT  chromaticities  of  dominant 

wavelengths  from  around  the  visible  spectrum  to  35,  50,  and 
2 

70  cd/m  achromatic  stimuli,  resulting  in  three  sets  of 
equally  bright  colors  used  in  subsequent  experiments  (see 
Costanza,  1981).  The  pairing  of  any  two  brightness-matched 
hues  therefore  resulted  in  a  perceived  contrast  attributable 
to  chromatic  differences  alone.  These  chromatic  differences 
were  scaled  in  terms  of  equivalent  achromatic  contrast  by 
having  observers  adjust  the  luminances  of  two  achromatic 
stimuli  until  their  perceived  contrast  equalled  that  of  each 
possible  pair  of  brightness-matched  colors.  A  second  study 
paired  all  the  color  stimuli  rather  than  only  brightness- 
matched  ones.  It  was  reasoned  that  a  color  contrast  metric 
of  high  utility  must  account  for  the  hue-related  brightness 
phenomenon,  and  that  mean  luminance  modulation  settings 
would  correlate  with  color  pair  vector  differences  in  such  a 
uniform  color  space. 


Evaluations  of  the  L*,u*,v*,  L*,a*,b*,  and  W,a,b  color 
spaces  showed  that  36%  to  90%  of  the  variation  in  mean 
modulation  settings  was  accounted  for  by  linear  regression 
models  of  color  contrast,  with  no  evidence  of  higher-order 
effects  (Post,  Costanza,  and  Lippert,  1982).  The  predictive 
power  of  each  metric  was  reduced  in  the  study  pairing  $11 
colors.  A  modified  version  of  L*,u*,v*  yielded  a  superior 
predictor  in  all  cases.  The  findings  suggest  that  a  useful 
transformation  of  color  contrast  into  equivalent  perceived 
achromatic  contrast  might  be  determined  through  further 
study,  and  that  the  equivalent  achromatic  contrast  might  be 
employed  to  specify  the  appropriate  display  of  colors  based 
on  existing  knowledge  of  visual  performance  with  achromatic 
displays.  The  next  research  objective  was  to  study  the 
effects  of  color  contrast  in  a  more  applied  setting. 

Lippert  (1984)  conducted  the  following  visual  performance 
study  (i.  e..  Study  1).  Ten  normal  trichromats'  numeral 
string  reading  speed  (RS)  was  determined  for  achromatic, 
yellow-green,  and  red  dot-matrix  numerals  (T)  presented 
against  eight  spatially  uniform  background  chromaticities 
(B)  on  a  0.5  m  distant  high- resolution  digital  shadowmask 
display  under  low  ambient  illumination.  The  numeral  colors 
were  displayed  at  a  constant  47  cd/m  and  paired  with  each 
background  chromaticity  displayed  at  seven  luminance  levels 
from  23  to  47  cd/m  ,  resulting  in  a  factorial  design  of 


eight  levels  of  chrominance  contrast  by  seven  levels  of 
luminance  modulation  (M),  or  56  color  contrast  conditions, 
for  each  numeral  color.  The  complete  listing  of  Study  1 
stimulus  color  parameters  is  in  Table  4,  Appendix  A.  All 
color  contrast  conditions  were  tested  for  3,  4,  and  5  digit 
random  numeral  strings. 

Performance  with  the  achromatic  and  yellow-green  numerals 
was  similar  overall,  varying  as  a  function  of  chrominance 
contrast  but  being  dominated  by  a  positive  correlation  with 
luminance  contrast.  RS  asymptoted  within  the  luminance 
contrast  range  tested.  The  red  and  purple  backgrounds 
proved  exceptions  in  that  relatively  high  performance  was 
obtained  even  without  luminance  contrast.  The  red  numeral 
chromaticity  overwhelmed  both  chrominance  and  luminance 
contrast  effects,  resulting  in  superior  performance  overall 
and  further  demonstrating  the  potential  for  chromatic 
contrast  alone  as  a  prime  determinant  of  visual  performance. 
The  effect  of  the  number  of  digits  read,  N,  was  shown  to  be 
independent  of  color  contrast  and  linear  over  the  range 
tested . 

Development  of  a  general  color-difference  metric  of 
legibi lity .  As  described  above,  the  first  AE  metrics  of  RS 
examined  were  L*,u*,v*,  L*,a*,b*,  and  Cohen  and  Friden's 
1976  W,a,b.  As  shown  in  Figures  5,  6,  and  7,  which  are 


scattergrams  of  AE  versus  mean  RS  (n 


90)  for  Tach,  Ty_G, 


(23) 


(24  ) 


and  Tred#  respectively,  with  all  of  the  Bs  tested  in  Study 
1,  AE  in  L*,u*,v*  is  essentially  unrelated  to  RS .  A  metric 
of  legibility  based  on  AE  in  L*,u*,v*  is  contra-indicated. 

The  primary  source  of  the  poor  correlations  between  AE 
and  RS  is  the  "convergent"  nature  of  each  of  the  color 
spaces  investigated,  shown  for  an  arbitrary  color  space,  in 
which  any  chromatic  plane  is  seen  edgewise  as  a  horizontal 
line  segment,  and  achromatic  intensity  is  represented  by  the 
vertical  dimension,  as  in  Figure  8.  Points  1  and  2  are 
arbitrary  chromaticities,  representing  a  target  (T)  and  a 
background  (B)  of  equal  luminance  for  purposes  of 
illustration,  and  the  entire  color  gamut  converges  to  the 
black  point  origin,  O.  The  vertical  axis  M  corresponds  to 
the  magnitude  of  luminance  modulation.  In  the  case  of  x 
3^,  as  M  increases,  B^'s  three-dimensional  position  in  the 
space  will  proceed  down  toward  0,  while  the  corresponding 
AE  decreases  from  point  1  to  point  3,  then  increases  beyond 
point  3  to  0.  Because  RS  increases  monotonically  with  M,  a 
color  space  resulting  in  such  a  non-monotonic  computation  of 
AE  must  create  problems,  not  only  in  fitting  RS,  but  also  in 
the  application  of  any  metric  to  specify  color  contrast, 
given  a  desired  performance  level  and  fixed  T/B 
chromaticities,  because  there  will  be  an  M-range  for  many 
T/B  combinations  over  which  a  distance  specification  (e.g., 
required  AE  computed  from  the  metric)  will  have  two 
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Figure  8.  Cross-section  of  an  arbitrary  convergent  color 

space  in  which  chromatic  planes  are  seen  edgewise 
as  horizontal  line  segments. 
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equidistant  loci  in  the  space  along  the  increasing  M 
dimension.  This  insurmountable  property  is  illustrated  in 
Figure  8  by  points  4  and  5,  equidistant  from  point  2.  There 
is  no  logical  approach  to  choosing  between  4  and  5  working 
from  the  metric;  thus,  any  such  space  is  seen  as  invalid  for 
the  purpose  of  color  production  specification. 

Actual  AE  computations  of  Study  1  contrast  combinations 

(Table  4,  Appendix  A)  further  illustrate  the  RS  modeling 

inconsistencies  of  L*,u*,v*  space.  Consider  the  T._rT  x  B 

^  ACH  p 

combination  versus  the  T__  x  B  combination  over  the  range 

RED  p  ^ 

of  M  tested:  at  M  ,  computed  AE  for  T.^tt  x  B  is  119.6, 

.006  ^  ACH  p 

at  M  it  is  113.3,  at  M  it  is  109.7,  then  106.3, 

95.8,  93.0,  and  finally  91.4  at  M  describing  an  inverse 

relationship  where  distance  in  L*,u*,v*  decreases  as  M  (with 
increasing  RS)  increases .  This  is  a  striking  example  of 
poor  RS  modeling  when  one  realizes  that  at  M  with 

distance  still  decreasing,  the  point  in  the  example  which  is 
analogous  to  the  minimum  distance  point  (3)  in  Figure  8  has 
not  yet  been  reached  for  this  T/B  pair.  The  implication  is 
that  minimum  AE  for  T^^  x  Bp  (and  therefore  minimum  RS ) 
occurs  at  an  M  level  higher  than  any  tested  in  Study  1. 

The  reverse  situation  is  seen  to  occur  for  T„__  x  B  over 

RED  p 

M  in  L* , u* , v* :  AE  at  M  is  146.3,  at  M  it  is  151.0, 

at  M  20  it  is  151.5,  then  152.9,  160.1,  161.4,  and  162.1  at 
M  This  direct  relationship  between  AE  and  M  is  a  more 


reasonable  descriptor  of  RS,  although  the  distance  in  this 
case  will  continue  to  increase  beyond  the  M  level  of  maximum 
performance,  suggesting  that  some  non-linear  relationship 
between  RS  and  AE  will  prove  the  best  performance 
descriptor/predictor . 

Obviously,  there  is  an  infinite  number  of  T/B 
combinations  specifiable  within  any  color  space  and,  as 
Figure  8  is  viewed,  it  becomes  clear  that  the  trans- luminous 
range  over  which  AE  decreases  from  the  M  =  0  level  is  a 
complex  function  of  the  slope  of  the  line  of  constant 
chromaticity  for  any  B  and  the  relative  positions  of  both  T 
and  B  with  respect  to  point  2,  or  ,  which  is  normal  to  the 
chromaticity  plane  and  defines  the  neutral  referent 
chromaticity,  which,  in  the  case  of  this  study,  is  Standard 
Illuminant  C. 

The  arbitrary  model  described  above  is  most  similar  to 
L*,u*,v*  space.  While  L*,a*,b*  and  W,a,b  are  somewhat 
different,  their  geometrical  cases  need  not  be  specifically 
addressed  here.  All  are  convergent  color  spaces  and  yield 
similar  inconsistencies  in  computed  AE  with  respect  to  RS . 

A  monotonic  description  of  RS  is  obtained  with  a 
variation  of  the  AE  formulation. 


AE40(Y,u' ,v' )  =  ((AY)2  +  (40  Au')2  +  (40  Av')2)0'5.  (1C 


(29) 


Because  Y,  u',  and  v'  are  physically  as  well  as 
mathematically  orthogonal,  the  Pythagorean  AE  formula  (10) 
defines  a  plane-origin,  or  "non-convergent",  three-space  in 
which  the  CIE  luminance  and  chrominance  constructs  are 
orthogonally  represented  in  all  cases,  unlike  L*,u*,v*  for 
instance,  in  which  u*  and  v*  are  linear  projections  of  u' 
and  v' ,  respectively,  as  a  function  of  L* . 

Various  luminance  and  chrominance  units  were  tested  in 

(10),  indicating  that  CIE  1931  Y  ( cd/m^ )  and  CIE  1976 

(u',v')  UCS  coordinates  provided  the  best  AE  descriptor  of 

RS  (Lippert,  1984).  A  weighting  factor,  40,  shown  in  (10), 

was  statistically  determined  to  adjust  the  relative 

contributions  of  luminance  contrast  and  chrominance  contrast 

component  scales,  which  have  no  a  priori  relationship  to  one 

2 

another,  yielding  a  maximum  linear  r  =  . 64  for  the  Study  1 
data  averaged  across  N  and  excluding  red  and  magenta 
chromatici ties .  T/B  combinations  including  red  or  magenta 
produced  near-maximum  RS  without  luminance  contrast  and  are 
poorly  described  by  this  Y,u',v'  metric.  However,  red 
and/or  magenta  combinations  yield  RS  (e.  g.,  legibility) 

which  is  underpredicted  by  the  metric.  Therefore,  the 
metric  is  expected  to  be  a  very  conservative  predictor  of  RS 
for  combinations  of,  or  with,  red  or  magenta. 

Figure  9  is  the  AE  by  mean  RS  scattergram  for  83  color 
combinations  excluding  red  and  magenta  (n  =  90)  in 


(30) 


Y,u' (40) , v' (40)  space.  A  parabola  comprised  of  AE  and  AEZ 

2 

terms  has  been  fitted  to  the  data  (R  =  .71),  revealing  a 
performance  asymptote  (RSmax  =  20  AE  units  contrast)  and  a 
possible  breakdown  of  the  metric  at  low  AE  based  on  the 
contrast  conditions  sampled.  Such  a  failure  of  the  metric 
is  probably  not  of  practical  concern  because  such  low  AE 
levels  of  contrast  should  never  be  prescribed  for  data 
display,  although  pictorial  imaging  may  require  the  display 
of  luminance  and  chrominance  gradients  of  the  smallest  steps 
attainable  with  a  display  system  in  order  to  achieve  more 
natural  appearing  representations  of  real-world  objects. 

If  possible,  we  wish  to  generalize  the  empirical  modeling 

results  to  the  entire  luminance  range  attainable  with  color 

raster  CRTs.  To  begin  this  process,  a  simplifying 

assumption  has  been  made.  RS  is  assumed  to  be  determined  by 

color  difference  independent  of  absolute  levels  of  photopic 

retinal  stimulation.  For  example,  T/B  contrast  comprised  of 

a  1.5:1  luminance  ratio  and  a  0.2  (u',v')  unit  difference  is 

assumed  to  yield  the  same  RS  for  any  Y,p  if  the  presentation 

polarity  is  positive  or  for  any  Yg  if  the  polarity  is 

negative.  The  luminance  ratio  =  Y  /Y  .  .  Therefore,  the 

max  mm 

scaling  procedure  is  accomplished  in  all  cases  using  Y 

max 

(Y^)  ,  where  Y^  is  the  mean  target  luminance.  It  will  be 
shown  immediately  following  description  of  an  initial 
general  metric  how  the  metric  might  be  adjusted  to  account 


for  variation  in  the  performance  contributions  of  luminance 
contrast  relative  to  chrominance  contrast  as  a  function  of 


The  weighting  factor,  40,  is  specific  to  the  particular 
Ym  in  Study  1,  for  which  averaged  =  45.9  +  0.5  cd/m2 . 
When  applying  this  AE  metric  to  the  more  broad  Y^  continuum, 
the  weighting  factor  must  be  scaled  proportionately  to  the 
difference  between  Y^  =  45.9  cd/m2  and  the  Y^  of  interest  in 
order  to  maintain  the  linear  relationship  between  luminance 
ratio  and  unit  chrominance  difference,  as  RS  determinant 
components  of  color  contrast,  characterized  by  AE  in  (10). 

Scaling  of  the  weighting  factor  may  be  performed  on  a 
"per  color  pair"  basis  for  any  YM  using  the  ratio  of  the 
(linearly  related)  optimal  weighting  factor  and  averaged  Y^s 
from  Study  1, 

(weighting  factor  /  Y^)  =  (40  /  45.9)  =  0.87  =  R,  (11) 

where  R  is  the  assumed  constant  relationship  between 
luminance  contrast  and  chrominance  contrast  components. 
This  results  in  a  Y^-corrected  weighting  factor,  W,  where. 


J 


The  meaning  of  1  unit  of  AE  with  respect  to  RS  is 
specific  to  the  weighting  factor,  40,  and  the  experimental 
Y^s.  However,  if,  as  assumed,  R  is  constant,  it  is  possible 
to  select  any  desired  scaling  of  the  AE  unit  to  provide  the 
most  convenient  color  difference  scale  for  a  given 
application.  This  is  merely  a  standardization  procedure. 


As  stated  above  for  the  Study  1  data  in  Y,u' (40) , v' (40) 
space,  AERSmax4Q  =20.  To  standardize  the  metric,  AERSmax 
was  set  to  equal  100  units  instead  of  20.  Then,  a  T/B  color 
pair  represented  by  50  AE  units  is  predicted  to  yield  50%  of 
RSmax  working  from  the  standardized  space.  This  feature  is 
incorporated  by  scaling  each  of  the  three  AE  components  (AY, 
Au',  and  Av' )  with  an  appropriate  ratio,  S,  determined  by 
the  fixed  relationship  between  Study  l's  weighting  factor, 
40,  and  RSmax  equivalent  AE,  20, 


40/20  =  2 


In  the  resulting  legibility  performance  space,  the 
standardized  weighting  factor  becomes, 


!RSmax  „  ><40  '  2°>  *  100  <2>  =  200  =  Wstd'  and  <14> 
std 


"T TTi- 


rT.'  *.'  r;  w;  w.'^'  j  v 


■■w  v  j  r;  i 


w  .  .  /  w  = 

std 


(15) 


Instead  of  scaling  AE  up  or  down  based  on  Y^,  it  is 
simpler,  starting  with  AY,  Au'  ,  and  Av'  ,  to  scale  any  YM 
directly  to  the  standardized  Y,u',v'  metric  using  the 
equations  already  described  but  shown  now  in  correct  order: 

W  =  0.87  (Ym),  (16) 

S  =  200  /  W,  and  (17) 

AEstd(Y'U' 'V' )  =  ((SAY>2  +  ( SWAu' )2  +  (SWAv’ )2)°-5,  (18) 


or  simply, 

( ( (230/Ym)AY)2  +  (200AU*)2  +  ( 200Av ' ) 2 ) 0 ‘ 5 .  (19) 

Equation  (19)  is  all  that  is  required  to  establish  a 

standardized  color  difference  metric  of  legibility  for  all 

Ym  in  which  AEDC  =  100.  Regression  analyses  performed 

using  this  AE  scale  on  different  data  sets  with  the  same 

dependent  measure  will  result  in  directly  comparable  fa- 

coefficients.  Of  course,  b-coef f icients  from  Y,u',v'  .  , 

std 

must  be  multiplied  by  S  to  return  them  to  their  original 
ef f ect-per-unit  meaning. 

The  Study  1  data  were  transformed  to  Y'u''v'std  wi*-h 
identical  results  to  the  original  40-weighting  regression 

(35) 


v*- 


procedure,  confirming  the  basic  rescaling  relationships 
among  R,  W,  and  S. 

Returning  to  Figure  9,  if  the  RSmax  equivalent  AE,  20,  is 
rescaled  to  100,  and  a  straight  line,  L,  is  drawn  from  (0,0) 
to  the  RS  asymptotic  point,  P,  then  the  AE  scale  becomes  a 
"percentage  improvement  in  legibility"  predictor  if  F  falls 
on  L.  The  AE  to  RS  relationship  described  here  only 
approximates  an  interval  one.  Yet  it  is  clear  from  Figure  9 
that  performance  prediction  by  this  method  with 
AEstd(Y,u' , v' )  is  statistically  most  accurate  at  AERSmax  and 
increasingly  conservative  with  respect  to  ensuring  legible 
contrast  as  AE  decreases.  Because  Figure  9  is  a  plot  of 
means,  such  a  performance  underprediction  approach  appears 
warranted. 

Consider  now  the  possibility  that  R  does  not  equal  0.87Y^ 
for  some  Y^,  that  is,  that  the  relative  contributions  to 
legibility  of  the  luminance  contrast  and  chrominance 
contrast  unit  components  vary  as  a  function  of  absolute 
levels  of  retinal  stimulation.  If  it  is  empirically 
determined  that  the  legibility  contribution  of  chrominance 
contrast  increases  linearly  with  Y^,  then, 

R  =  (0.87Ym)  (Cy  ),  (20) 

M 


(  36) 


where  C  is  the  linear  coefficient  describing  the  increasing 

contribution  of  chrominance  contrast  as  a  function  of  Y„ 

M 

and. 


c  < 

1 

for 

(0.87Ym) 

A 

O 

(21) 

c  = 

1 

for 

(0.87Ym) 

=  40  and. 

(22) 

c  > 

1 

for 

(0.87Ym) 

o' 

A 

(23) 

where  (C  =  1  for  (0.87Y^)  =  40)  has  been  determined  for  the 
Study  1  data. 

If  future  evidence  suggests  C  is  non-linear  with  respect 
to  some  range  of  the  Y^  continuum,  then  C  becomes  the  sum  of 
the  coefficients  describing  the  non-linear  relationship  as  a 
function  of  Y^  over  that  range.  This  scheme  provides 
considerable  modeling  flexibility.  For  instance,  C  might  be 
an  inverted  "U"  function  of  Y^,  a  higher-order  function,  or 
even  be  described  by  a  discontinuity,  perhaps  at  a 
transition  level  of  Y^  from  predominant  cone  to  mesopic 
vision . 

Chrominance  axes  rescaling.  Thus  far,  the  u'  and  v' 
scales  have  been  employed  in  the  AE  metrics.  By 
independently,  linearly  rescaling  the  u'  and  v'  axes 
relative  to  one  another  and  determining  the  appropriate 
weighting  factor  relative  to  Y  (i.  e.,  independently. 


2 

linearly  rescaling  Y)  to  achieve  maximum  r  A£/  a  metric  has 
been  achieved  which  describes  the  entire  data  set  from  Study 
1  including  the  red  and  magenta  chromaticities  (Figure  10). 
The  results  of  this  process  are  described  more  fully  by 
Lippert  ( 1984 ) . 

Figure  10  illustrates  a  strong  relationship  between 
2  2 

combined  AE  and  AE  terms  and  RS  (parabola  R  =  0.71).  It 
is  this  type  of  fully  rescaled  metric  with  which  the  balance 
of  this  report  will  be  concerned.  In  addition  to  Y,  the 
l°gio(Y)<  (e.  g.  ,  logY)  and  L*  luminance  scalings  were 
considered  as  metric  components.  Figures  11  and  12 
illustrate  the  similar  RS  modeling  potentials  of  all  three 
luminance  scalings  as  combined  with  2.2:1  rescaled  (u':v') 
chromatic! ty  axes. 

It  was  shown  earlier  in  this  section  that  the  convergent 
L*,u*,v*  space  does  not  yield  a  AE  scale  which  describes  RS 
well,  nor  does  it  always  provide  a  unique  color  pair 
solution  given  one  color  specification  and  a  required  AE  for 
a  desired  level  of  legibility.  However,  by  rescaling  the 
L*,  u*,  and  v*  axes  to  fit  the  Study  1  data  as  well  as 
possible,  an  RS  description  similar  to  those  of  the  non- 
convergent  spaces  is  obtained.  This  AE  to  RS  relationship 
is  shown  by  Figure  13,  which  appears  very  similar  to  Figures 
10-12.  It  is  important  to  note  that  the  convergent  nature 
of  L*,u*,v*  space  is  not  completely  eliminated  by  the 

(38) 


AE(L*/u',v')  by  RS  scattergram 
rescaled  chrominance  axes. 
Study  1  (Lippert,  1984). 


rescaling  procedure.  Therefore,  it  will  in  some  cases 
provide  similar,  although  not  unique,  solutions. 

Table  2,  RESULTS,  contains  a  listing  of  AE  component 
weightings  and  regression  coefficients  and  Table  3,  RESULTS, 
lists  the  AE  component  weightings  and  the  lluminance- 
generalized,  AE-standardized  expressions  for  the  Y,u',v', 
logY,u',v',  L*,u',v',  and  L*,u*,v*  metrics. 

Luminance-generalization  and  AE-standardization  of  the 
(Y,u',v')  metric  with  independently,  linearly  rescaled  axes 
is  based  on  the  complete  Study  1  stimulus/response  set  as 
follows : 


2 

Ym  =  46.6  cd/m  (averaged  across  all  Ts),  (24) 

weighting^  =  110,  and  (25) 

weighting.^,  =  50.  (26) 

Factoring  50  from  the  weightings. 


weighting^ , 

=  50  ( 

2.2  u*  ), 

and 

(27) 

weighting^ 

=  50  ( 

1.0  v' ); 

therefore , 

(28) 

weighting 

=  50 

(29) 

for  both  chrominance  components  (2.2  remains  with  u'). 


'  i 

J 


*» 


** 


£ 


S! 


si 

zi 


.•V/., 


V "/  V  .•  ' 


(42) 


R  =  weighting  /  =  50  /  46.6  =  1.073  and,  (30) 


W  =  1 . 073Y  . 

M 


Now,  from  Figure  10, 


AEnc,  =  30, 
RSmax 


corresponding  to  an  RSmax  =  2.37  compared  to  2.17  for  the 
metric  excluding  red  and  magenta.  The  red  and  magenta 


inclusive  metric  requires  greate 


'dieted  RS  for  100% 


legibility  and  is  therefore  sn  Lj  more  conservative. 

Now,  the  relationship  between  the  weighting  factor  and 

AE_,_  is, 

RSmax 


50  /  30  =  1.6667. 


Selecting  AERSmax  =  100, 


Wstd  =  (  100  )  (  1-667  )  =  166.7,  and 


S  =  W  .  ,  /  W  =  166.7  /  1.073Y„  =  155.3 

std.  '  '  M 


Completing  the  process  via  (18), 


AE  =  (((155  /  Ym)  AY)2  +  ( 367Au ' ) 2  +  ( 167Av ' ) 2 ) ° ' 5 .  (36; 


For  a  logY  metric,  the  weightings  are  1.43  (  Au  ' )  and 

0.65(Av'),  resulting  in  the  final  expression, 

AE  =  C(250/logYM)(log(YT/YB))2  +  (7.67  Au')2 

+  (3.49  Av')2]°.5,  (37) 

for  the  L*  version,  the  weightings  are:  7 9 C  Au')  and 
3  6 (  Av  ' ) ,  resulting  in, 

AE  =  [((260/L*m)  AL*)2  +  (440  au')2  +  ( 200  Av  • ) 2  ] 0  •  5  .  (38) 

The  weightings  for  L*,u*,v*  are  1.0  (AL*),  0.00583(  au*) , 

and  0.00278C  Av*),  resulting  in 

AE  =  C((595/L«M)  AL*)2  +  ( 0 .75  Au*  )2  +  (0.36  Av*)2]0.5  (39) 

Equations  (36),  (37),  (38),  and  (39)  are  the  metrics  of 
particular  interest  to  this  report.  They  do  not  include  any 
task  effect  other  than  that  of  color  contrast  and  are 
designed  for  use  as  relative  indices  of  legibility. 

Figure  14  is  a  scattergram  of  luminance-generalized ,  AE- 
standardized  AE(Y,u' ,v' )  versus  RS  from  Study  1.  Except  for 
an  insignificant  amount  of  numerical  disagreement  due  to  the 
different  AE  computational  procedures  used  for  Figure  14 
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versus  Figure  10,  the  standardized  formula  provides  the  same 
AE-to-RS  relationship  as  does  the  non- standardized  version. 
The  AE  scale  in  Figure  14  relates  RS  to  AE  =  100  and,  if 
used  as  a  percent  improvement  in  legibility  scale,  provides 
underestimations  of  the  observed  RS  means  below  RS 


Research  Objectives 


Study  1  determined  that  a  non-convergent  AE  space  is 

appropriate  for  RS  modeling.  It  provided  a  large  data  set 

with  which  to  develop  AE  metrics  but  tested  only  positive 

2 

presentation  polarities  and  only  at  a  single  =  46.6  cd/m 

(averaged)  across  Ts.  Study  2  questions  the  validity  of  the 

assumption  that  the  effect  of  AE  on  RS  is  independent  of  the 

absolute  level  of  retinal  stimulation,  that  is,  that  R  is  a 

2 

parametric  constant,  by  employing  a  YT  =  20  cd/m  ,  and 

considers  the  application  of  any  single  metric  to  both 
positive  and  negative  presentation  polarities. 


The  luminance-generalized,  AE-standardized  metrics  are 
employed  in  Study  2  to  determine  the  merit  of  the  assumption 
that  R  is  constant.  To  the  extent  that  the  metrics  for  the 
positive  presentation  polarities  in  Study  1  adequately 
describe  RS  in  Study  2,  the  method  of  legibility  prediction 
at  low  contrast  levels  using  AE  would  be  confirmed  and 
generalized . 


METHOD 


Digital  Color  Video  System 

The  display/radiometric  system  used  in  this  research  is 
actually  an  evolved  combination  of  computer,  refresh-type 
digital  image  processor,  CRT,  radiometric  measurement,  and 
laboratory  interconnection  systems.  It  is  capable  of 
displaying  color  images  within  the  design  limits  of  its 
components,  scanning  images  spectroradiometrically,  and 
reducing  the  resultant  data  to  photometric  parameters.  The 
measurement  subsystem  is  calibrated  to  a  National  Bureau  of 
Standards-traceable  spectral  radiance  source  (approximately 
standard  illuminant  A),  allowing  the  user  to  develop, 
display,  verify,  and  report  the  power  spectra  of  color 
images.  A  technical  report  describing  the  system's 
functional  requirements  has  been  published  (Farley  and 
Gutmann,  1980).  This  subsection  addresses  the  preparation 
of  the  laboratory  system  for  this  particular 
experimentation. 

Monitor .  An  Aydin  Model  8025  19-inch  diagonal  high- 
resolution  color  monitor  serves  as  the  display.  It  employs 
a  delta-gun  (R,  G,  and  B)  Mitsubishi  shadow  mask  CRT 
incorporating  P22  phosphors  and  a  0.29  mm  triad  pitch.  The 
raster  is  2:1  positively  interlaced  and  the  normal  aspect 


ratio  is  adjusted  to  1:1  (square  active  display).  The 
monitor  is  driven  by  an  IIS  Model  70  digital  refresh  image 
processor  (RS-170  standard),  which  delivers  512  x  512 
picture  element  (pixel)  images  to  each  of  the  CRT  guns  for 
the  square  trichromatic  display.  Refresh  rate  is  60 
fields/s  (30  frames/s). 

CRT  convergence.  Visual  inspection  of  variously  colored 
numerals  and  backgrounds  indicates  that  normal  CRT  dynamic 
convergence  adjustment  procedures  using  a  cross-hatch 
pattern  are  suboptimal.  The  slightest  misconvergence 
results  in  visible  dark  outlines  for  the  numerals,  outlines 
which  are  unwanted  display  artifacts  that  increase  perceived 
numeral/background  contrast  beyond  that  produced  by  the 
photometrically  controlled  stimulus  parameters.  It  was 
decided  to  adjust  the  dynamic  convergence  at  each 
experimental  task  position  of  the  screen  while  actual 
experimental  images  were  being  displayed.  The  procedure  was 
to  adjust  for  minimum  perceived  contrast,  which  is  obtained 
with  perfect  convergence,  but  which  is  not  obtainable  on  the 
monitor  employed  over  the  entire  area  of  any  task  location, 
much  less  over  the  entire  screen.  For  this  reason  as  well 
as  the  spectral  radiance  nonuniformities  discussed  below, 
the  accuracy  of  spatially-dependent  displayed  color 
specifications  remains  somewhat  in  question.  Observers 
noted  that  the  numerals  are  less  easily  read  given  the  above 


adjustment  procedure,  especially  at  low  contrast  levels. 
Black  and  white  facsimiles  of  best  and  poor  attainable 
convergence  for  red  numerals  on  an  achromatic  background  are 


shown  in  Figure  15,  a  and  b,  respectively.  The  "poor" 
example  is  within  the  manufacturer's  specifications, 
indicating  that  a  global  adjustment  strategy  is  not 
sufficient  when  local  convergence  is  critical. 

Raster  delay.  A  response  trigger  was  used  by  subjects  to 
begin  and  end  experimental  trials  while  automatically 
recording  response  time  as  measured  by  a  1-ms  resolution 
clock.  A  lag  time  was  specified  for  the  display  of  task 
numerals  once  a  trial  was  initiated.  For  task  1  (e.g.,  top 
center  3-digits)  lag  time  is  negligible.  For  task  2  (e.g., 
left  center  4-digits)  and  task  3  (e.g.,  right  center 
5-digits)  it  is  8  ms.  This  measure  of  lag  time  assumes  the 
clock  and  first  HUD  field  begin  at  the  end  of  the  raster's 
vertical  retrace  following  trigger  depression  and  is 
linearly  related  to  the  number  of  lines  below  raster-line  1 
of  the  numerals'  bottom  line  (first  field).  When  the 
trigger  is  released,  the  clock  is  stopped  and  an  inter¬ 
stimulus  image  ( I S I )  is  displayed.  End-of-trial  functions 
occur  within  approximately  1  ms  of  subject  response. 

However,  there  is  an  inaccuracy  inherent  in  this 
procedure.  It  is  due  to  the  interval  between  the  final 
refresh  of  the  numerals  (ending  a  trial)  and  the  first  paint 
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of  the  ISI  over  the  task  numeral  location.  The  P22 
phosphors'  persistences  range  from  about  22  microseconds  (B) 
to  1  ms  (R)/  presenting  no  significant  problem,  but  the 
retinal  persistence  of  the  task  numeral  images  may  be  great 
enough  to  last  until  re-stimulation  by  the  ISI  on  the 
following  raster.  This  would  introduce  a  systematic 
increase  in  the  time  the  numerals  are  visible  beyond  that 
recorded  by  the  clock.  It  is  a  concern  that  response 
correctness  might  be  inflated,  because  the  observer  could 
continue  to  process  task  information  after  the  end-trial 
response.  Because  subjects  were  instructed  to  perform  as 
quickly  and  accurately  as  possible  and  because  it  is  assumed 
they  will  not  end  the  trial  before  extracting  the  requested 
information,  the  response  time  measure,  which  is  accurate  to 
within  approximately  2  ms  once  corrected  through  subtraction 
of  the  fixed  image  onset  delay  of  8  ms  (Tasks  2  and  3  only) 
via  software,  is  considered  of  primary  importance. 

Radiometric  measurement  system.  A  subsystem  very 
critical  to  experimental  control  is  the  radiometric 
measurement  system  (MS)  described  in  block  diagram  form  by 
Figure  16.  The  MS  is  a  modified  Gamma  Scientific,  Inc., 
visible  band  system.  To  determine  the  power  spectrum  of  a 
source  of  light,  some  of  its  emission  is  collected  by  a 
fibre-optic  cable  and  fed  to  a  monochrometer  which  optically 
processes  that  energy  into  a  spectrum  of  known  location  in 


space.  The  spectrum  is  sampled  (scanned)  over  a  specified 

range  (380  to  760  nm  in  5.0-nm  increments  for  this  study)  (J 

and  the  narrow  bandwidth  samples  (half-power  bandpass  of  the 

$ 

monochrometer  is  5.0  nm)  are  amplified  by  a  photo-multiplier  'jfl 


tube  (PMT) .  The  sample  values  are  stored  in  computer  files 


(scan  files)  which  contain  radiance  values  as  a  function  of 
wavelength.  Photometric  values  are  computed  directly  from 
the  radiance  files  via  software. 

Calibration.  It  is  necessary  to  calibrate  the  MS 
relative  to  a  full-spectrum  standard  light  source,  the  power 
spectrum  of  which  is  known  and  is  stored  in  the  computer. 
The  amplified  radiance  samples  collected  from  this  source 
are  sent  to  the  computer  in  the  form  of  analog  voltages 
which  are  digitized.  Because  each  sample's  wavelength  is 
known,  its  corresponding  absolute  radiance  value,  stored  in 
the  computer,  is  simply  divided  by  the  sample  value  to 
create  correcting,  or  scaling,  factors.  Scaling  factors 
permit  the  accurate  characterization  of  unknown  spectral 
distributions  even  though  the  gain  and  linearity  of  the  MS 
(although  not  the  absolute  output  of  the  radiance  source) 
are  expected  to  vary  with  time. 

A  source  of  unknown  spectral  distribution  (a  CRT  image) 
is  sampled  at  the  same  wavelengths  as  is  the  standard  light 
source.  The  unknown's  sampled  digital  values  are  multiplied 
by  the  corresponding  scaling  factors,  yielding  a  relative 
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spectroradiometric  characterization.  This  scanfile  may  be 
weighted  by  the  1931  C.I.E.  2-degree  color-matching 
functions  and  integrated  across  the  spectrum  to  yield 
tristimulus  values  and,  therefore,  any  desired  photometric 
transforms . 

A  new  spectral  radiance  standard  utilizing  a  General 
Electric  DYT  19-volt  rated  quartz  tungsten-halogen 
projection  lamp  has  been  developed  in-house.  A  90-min 
repeated-scan  procedure  indicated  the  combined  radiance 
source/MS  drift  to  be  on  the  order  of  0.5  %  in  luminance  and 
0.0002  in  either  CIE  x  or  y. 

The  radiometric  measurement  techniques  employed  assume 
that  a  uniform  light  source  is  being  scanned.  For  the  CRT, 
radiometric  measurements  exhibit  no  appreciable  change  as  a 
function  of  screen- to-collector  distance  or  incidence  angle 
(within  reasonable  limits).  In  other  words,  the  shadow  mask 
display  behaves  much  like  a  cosine  radiator  and  is 
appropriately  measured  using  our  current  procedure.  The  DYT 
radiance  source  was  calibrated  by  Hoffman  Engineering 
Corporation  by  sampling  only  the  central  1.27  cm  spot  on  its 
diffuser.  This  procedure  is  matched  and  maintained  for  all 


MS  calibrations. 


Sources  of  error.  There  are  two  major  sources  of  error 
associated  with  the  display/radiometric  measurement  system. 
The  first  is  the  MS  itself.  The  PMT  is  particularly  prone  to 
signal-to-noise  and  drift  problems,  which  are  controlled  by 
averaging  multiple  samples  at  each  wavelength  and  frequent  MS 
calibration . 

The  second  source  of  error,  the  output  stability  of  the 
monitor,  can,  in  general,  only  be  recorded  for  post¬ 
experiment  adjustment  of  stimulus  parameter  specifications, 
or  it  may  simply  be  accepted  as  a  source  of  noise  in 
experiment  response  data.  The  reported  color  parameters  for 
this  study  were  determined  at  the  midpoint  of  the  data 
collection  sequence.  R,  G,  and  B  command  values  were  not 
adjusted  during  data  collection  sessions. 

Monitor  characterization  requires  that  each  gun  (R,  G, 
and  B)  be  scanned  throughout  its  available  luminance  range 
using  the  calibrated  MS.  This  results  in  bits-to- 
luminance  (computer  command- to-display  output)  functions  for 
each  gun.  A  computerized  algorithm  searches  look-up  tables 
of  1931  CIE  tristimulus  values  derived  from  the  sampled 
points  along  each  function  which  are  additively  combined  and 
tested  to  provide  R,  G,  and  B  bit-values  for  any  desired  CRT- 


attainable  stimulus  color. 


System  stability.  As  explained  earlier,  the  luminance 
and  chrominance  of  displayed  colors  should  be  controlled  to 
within  1  JND.  Repeated  pre-experiment  scans  of  the  monitor 
indicate  an  overall  measurement/display  system  stability. 
The  luminance  instability  was  determined  through  nightly 
scanning  of  all  experimental  stimulus  colors  over  the  8-day 
data  collection  period  and  equalled  less  than  1.5  %  of  any 
of  the  stimulus  luminance  specifications  in  Table  4, 
Appendix  A. 

System  baseline  tests .  A  set  of  baseline  tests  was 
developed  to  evaluate  the  eleuLiical  and  mechanical 
integrity  of  the  display/radiometric  system  and  provide  a 
means  of  detecting  changes  in  system  component  performance. 
These  tests  were  performed  before  monitor  characterization 
for  this  study. 

Experiment  control .  The  laboratory's  PDP  11/55 
minicomputer  and  associated  networks  were  used  for  real-time 
control  of  all  display/radiometric  measurement  system 
operations  for  this  experiment,  including  data  collection 
and  storage.  Data  were  transferred  to  the  Virginia  Tech  IBM 
370  for  analyses  using  the  Statistical  Analysis  System. 


"  »  ^  -v.  I“-  "V.  -  ~W  • 


Stimuli 

Active  di splay .  All  stimuli  were  presented  on  the  Aydin 
monitor  in  a  central  26  cm  x  26  cm  area  defining  the 
background  display.  The  512  x  512  pixel  image  thus  contains 
pixels  equally  spaced  horizontally  and  vertically, 
facilitating  programming  of  the  7x9  dot-matrix  Huddleston 
font  numerals  which  were  designed  for  maximum  legibility  in 
high  ambient  illumination  environments  (Snyder,  1980).  At  a 
viewing  distance  of  0.76  m,  the  active  display  subtended  20 
visual  degrees  horizontally  and  vertically,  resulting  in  a 
limiting  display  resolution  of  approximately  13 
cycles/degree.  The  0.76  m  viewing  distance  in  Study  2  was 
increased  from  the  0.5  m  viewing  distance  in  Study  1.  This 
was  done  to  determine  whether  this  limited  range  of  viewing 
distance  had  a  marked  effect  on  the  ability  of  Study  1 
metrics  to  describe  RS  in  Study  2.  Because  visual 
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Numeral  strings  and  background.  On  a  given  trial  color 
contrast  was  accomplished  by  displaying  a  HUD  and  background 
of  known  color  parameters.  The  task  numerals  are  completely 
surrounded  by  a  background  field  of  desired  parameters. 
While  each  of  the  10  background  fields  is  independently 
addressed,  only  three  of  the  fields  surround  task  numerals. 
Other  fields  were  of  color  parameters  randomly  selected  from 
the  entire  pool  of  experimental  background  colors. 

The  nominal  HUD  and  background  chromatici ties  selected 
for  Study  2  are  depicted  on  the  CIE  1931  (x, y)  chromaticity 
diagram  by  Figure  18,  and  the  measured  stimulus  color 
parameters  are  listed  in  Table  5,  Appendix  A.  Included  are 
achromatic  (ACH),  yellow-green  (Y-G),  and  red  (RED)  Ts  and 
purple  (pl,p2),  magenta  (ml, m2),  red  (rl,r2),  green  (g),  and 
achromatic  (a)  Bs.  The  vertices  of  the  triangle  inset 
represent  the  chromaticities  of  the  Mitsubishi  CRT  P22 
phosphors . 


& 


‘m* 

k! 


> . 
4  * 


•7 

v 

v 


£ 


»  A 

3 


v! 

v 


2 

Interstimulus  image  ( ISI ) .  A  2  pixel  black-and-white 

checkerboard  filled  the  active  display  area  between  trials. 

The  white  squares  were  displayed  at  the  constant 

2 

experimental  numeral  luminance  (20  cd/m  ).  The  intended 
functions  of  the  ISI  were  to  maintain  the  luminance 
adaptation  state  of  subjects  between  trials,  minimize 
successive  chromatic  adaptation  confounds  (Wyszecki  and 
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Stiles,  1967,  p.  235),  provide  interference  with  any  retinal 
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persistence  of  the  task  numeral  images,  and  maximize  the 
trial-to-trial  luminance  output  stability  of  the  monitor. 


Experimental  Design 

The  study  investigated  reading  speed  (RS)  as  a  function 
of  color  contrast  operationally  defined  in  terms  of  (1) 
numeral,  or  target  (T)  chromaticity,  (2)  background  (B) 
chromaticity,  and  (3)  target- to-background  (T/B)  luminance 
modulation  (M) . 

Luminance  and  chrominance  contrast .  The  attainability  of 
desired  color  parameters  is  limited  by  the  phosphors  and 
operating  characteristics  of  the  CRT  and  the  digital 
resolution  of  the  image  processor  employed,  and  varies  from 
maximum  flexibility  at  low  purities  to  greatly  reduced 
flexibility  at  high  purities.  Parameter  maintainability  is 
an  expression  of  the  display  system  stability,  as  specified 
previously.  Scan-to-scan  variability  in  stimulus  L  measures 
indicated  a  maintainability  of  +  2  %  over  a  two  week  period. 
For  this  reason,  no  problem  existed  in  maintaining  T  or  B 
luminances  to  achieve  linear  steps  in  M  of  approximately 
0.09. 

Experimental  manipulations .  The  experimental  design 
consisted  of  seven  levels  of  M  combined  factorially  with 
eight  levels  of  B,  resulting  in  56  color  contrast  (T/B) 


combinations  for  each  of  the  three  Ts  in  each  of  three 


reading  tasks,  3,  4,  and  5  digits  (N) .  Figure  19  shows  the 

nominal  B  x  M  x  N  stimulus  matrix  for  each  T.  Note  that 

luminance  modulation  may  take  on  positive  or  negative  values 

as  defined  for  Study  2  (M  =  (LT  -  Lg )/( LT  +  Lg)), 

corresponding  to  positive  presentation  polarities  (i.  e., 

lighter  characters  on  a  darker  background)  and  negative 

polarities,  respectively.  The  Study  1  experimental  design 

matrix  differs  from  that  for  Study  2  only  in  that  different 

levels  of  the  T,  B,  and  M  dimensions  were  employed  (Table  4, 

Appendix  A)  .  As  stated  previously.  Study  1  concentrated  on 

2 

positive  presentation  polarity  between  46  cd/m  numerals  and 

a  parametric  selection  of  chromaticies  while  Study  2  used  20 
2 

cd/m  numerals  in  both  positive  and  negative  presentation 

polarities  with  a  predominance  of  chromaticities  from  the 

red,  magenta,  and  purple  chromaticity  sectors.  In  both 

studies,  the  white  T  (ACH)  had  only  55  T/B  combinations 

because  it  alone  was  nearly  identical  chromatically  to  the 

achromatic  B.  The  lowest  MD  stimulus  condition  for  T.„„ 

a 

was  therefore  eliminated  because  there  was  essentially  no 
luminance  or  chrominance  contrast  to  distinguish  T  from  B. 

The  experiment  consisted  of  four  replications  of  the  T  x 
B  x  M  x  N  design  for  each  of  six  subjects,  totalling  12,024 
trials.  The  first  replication  was  used  as  a  training 
exercise  while  replications  2-4  constituted  the  9018 
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experimental  observations  acquired  for  the  present  analysis. 
Total  trial  time  averaged  about  10  s/trial,  resulting  in  33 
hours  of  data  collection. 


Subjects 

Six  volunteers  (three  females)  from  the  Virginia 
Polytechnic  Institute  and  State  University  student  pool 
served  as  subjects.  All  were  screened  for  normal  color 
vision  using  Dvorine  Pseudo- Isochromatic  Color  Plates  (there 
were  no  misses)  and  for  20/20  uncorrected  near  and  far 
visual  acuity  using  a  Bausch  &  Lomb  Orthorater.  Upon 
completion  of  all  experimental  sessions,  the  experimenter 
debriefed  each  subject,  offered  a  summary  of  the  results  at 
a  future  date,  and  paid  each  subject  20  dollars. 

Procedure 

Screened  subjects  participated  in  one  60-min  session 
every  day  (completing  one  replication  per  session).  The 
instructions,  to  read  as  quickly  and  accurately  as  possible 
one  specified  numeral  string  per  trial,  were  read  to  each 
subject  at  the  first  session.  At  each  session,  a  subject 
was  presented  the  ISI  for  three  to  five  minutes,  allowing 
his/her  visual  system  to  adapt  to  the  moderate  luminance  of 
the  display.  The  sessions  were  divided  into  three 
subsessions,  each  equalling  one-third  of  one  replication  of 
the  study,  with  a  short  rest  period  between  subsessions. 
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Each  subsession  consisted  of  167  trials  in  three  blocks. 
Each  block  was  of  one  T,  and  the  presentation  order  of  Ts 
was  randomized  with  the  constraint  that  each  of  the  three  T- 
blocks  began  one  subsession  in  each  session.  Within  blocks, 
the  B  x  M  x  N  combinations  were  randomized  with  the 
constraint  that  a  complete  replication  be  presented  within 
one  session. 

Figure  20  is  a  photograph  of  a  subject  seated  in  typical 
posture  at  the  subject  station.  A  forehead  rest  maintained 
a  centered  active-field  subtense  of  20  degrees.  Not  shown 
is  a  black  surround,  subtending  approximately  55  vertical 
and  80  horizontal  visual  degrees,  which  was  fitted  over  the 
active  display  to  mask  the  monitor  casing.  The  experiment 
was  conducted  without  room  illumination  to  control  for 
possible  chromatic  adaptation  effects  of  the  experiment  room 
walls.  A  lightly  spring  loaded  momentary  solid  state  switch 
set  into  a  Dan  Wesson  finger-grooved  pistol  grip  was  used  by 
subjects  as  an  index  finger  trigger  for  the  display.  The 
grip  was  mounted  for  comfortable  usage  with  a  15-degree 
forward  tilt  and  a  removable  wooden  block  for  resting  the 
heel  of  the  hand  if  desired.  This  assembly  is  free¬ 
standing,  may  be  positioned  by  subjects  for  maximum  ease  of 
use,  and  could  be  used  with  either  hand.  Subjects  typically 
leaned  into  the  forehead  rest  as  in  Figure  20,  but  a  padded 
lumbar  support  was  provided  for  those  who  wished  to  sit 
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upright.  The  padded  seat  was  adjusted  in  height  for  each 
subject  to  match  eye-height  to  center-screen  height.  A 
foot-rest  was  furnished  for  one  subject  requiring  it.  The 
experimenter  controlled  each  session  seated  at  a  VDT  behind 
the  subject  station. 

Each  trial  began  when  the  experimenter  informed  the 
subject  of  the  reading  task  for  that  trial  (i.  e.,  airspeed, 
heading,  or  altitude).  The  experimenter  received  this 
information  from  the  VDT.  When  ready,  the  subject  initiated 
the  trial  by  triggering  the  solid  state  switch  and  holding 
it  in.  When  the  requested  information  was  extracted  from 
the  display,  the  subject  released  the  trigger,  stopping  the 
response-time  clock  (automatically  recording  and  indexing 
elapsed  time  for  the  trial)  and  replacing  the  experimental 
field  with  the  ISI.  The  subject  then  verbalized  the  digits 
read  for  the  experimenter,  who  checked  the  verbalization 
against  the  correct  numeral  sequence  and  kept  a  cumulative 
error  count  by  hand.  Error  rate  in  Study  2  is  as  in  Study  1 
and  in  Post's  (1983)  experiment,  about  2  percent.  For 
correct  responses,  the  experimenter  struck  the  return  key 
or,  some  number  of  zeros  (erroneous  digits)  was  entered 
followed  by  the  return  key,  which  always  signaled  the 
computer  to  begin  the  next  trial.  The  computer  then 
prompted  the  experimenter  who  in  turn  prompted  the  subject 
for  the  next  trial,  etc.  Care  was  taken  not  to  force  any 


subject  into  a  minimum  trial-time  mental  set.  Rather,  the 
emphasis  was  on  relaxed  attention  to  the  display  for  subject 
initiation  of  each  trial. 


RESULTS 


The  present  study  (2)  was  conducted  to  determine  the 
generalizabi lity  of  any  metrics  developed  from  the  Study  1 
data.  The  success  of  four  Study  1  metrics  in  describing  the 
Study  1  data  assists  researchers  of  this  type  of  performance 
because  it  precludes  the  analysis  of  variance  procedures 
used  in  Study  1  to  determine  levels  of  performance 
associated  with  levels  of  color  contrast.  It  has  been 
demonstrated  that  an  appropriate  continuous  scale  of  AE  may 
be  applied  to  the  description  of  RS.  The  analysis  of  Study 
2  therefore  reduces  to  least-squares  regressions  of  AE  from 
Study  1  metrics  with  RS  from  Study  2. 

CIE  1931  (X,Y,Z)  tristimulus  values,  1976  (u',v') 
coordinates,  L* ,  and  log1Q(Y)  were  computed  for  each  T  and 
B.  The  167  stimulus  color  combinations  for  each  study  were 
transformed  into  several  AE  scales  based  on  the  above 
luminance  components,  each  combined  with  rescaled  (e.  g. , 
2.2:1  u':v')  chromaticity  coordinates  as  described  in  the 
INTRODUCTION.  In  addition,  AE(L*,u*,v*)  was  computed. 
These  continuous  one-dimensional  variable  scales  were 
evaluated  as  predictors  of  RS  via  regression  analysis. 

The  regression  coefficients  of  the  rescaled  Study  1 
metrics  are  summarized  in  Table  2  for  the  RS  data  sets  from 
Study  1  and  Study  2.  The  Statistical  Analysis  System  GLM 
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Table  2.  AE  Component  Weightings  and  Regression 
Coefficients  for  the  Rescaled  Metrics 


RS  Data  Set 


Study  1 

Parameter  b-Coef f icient* 


Study  2 
b-coef f icient* 


AE(Y/U' , V* ) : 

.923 

.951 

Intercept 

1.7087 

1.6522 

N 

-0.3980 

-0.3655 

AE 

0.1598 

0.1631 

AC2 

AE 

-0.0028 

-0.0025 

AE ( logY, u ' , v 

.904 

.929 

Intercept 

1.8205 

1.7081 

N 

-0.3980 

-0.3655 

AE 

12.1732 

12.5470 

AE2 

-17.0113 

-16.6709 

AE ( L* , u ' ,  V '  ) 

: 

.914 

.946 

Intercept 

1.7923 

1.7343 

N 

-0.3980 

-0.3655 

AE 

0.2087 

0.1909 

AE2 

-0.0050 
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Table  2,  continued 


AE(L*,u*,v*) : 
Intercept 
N 
AE 
AE2 


.909 


1.8581 

-0.3980 

0.1217 


1.8022 

-0.3655 

0.1206 


-0.0019 


-0.0017 
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procedure  Student's  t-tests  on  each  b-coef f icient  determined 
each  slope  to  be  non-zero  (i.  e.,  all  of  the  model  terms  are 
meaningful  predictors  of  RS  p  <  0.003).  The  effects  of  N 
and  AE  are  independent  in  Study  2  as  they  are  in  Study  1. 

The  Table  2  regression  models  are  of  the  form, 


RS  =  x. 


+  x.^N  +  x2AE 


+  x3AE 


e, 


(40) 
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where  xQ  is  the  RS  intercept,  x^  is  the  b-coef f icient  (a 

negative  value)  for  the  number  of  digits  read,  is  the  fa- 

coefficient  for  the  root  sum  of  squares  of  the 

independently,  linearly  rescaled  luminance  and  chrominance 

color  contrast  components  (AE),  x3  is  the  b-coef f icient  (a 

2 

negative  value)  for  AE  ,  and  e  is  random  error  variability. 

Figures  21  and  22  are  scattergrams  of  generalized, 

standardized  AE(Y,u',v')  by  RS  and  AE( L* , u * , v* )  by  RS  in 

Study  2,  respectively.  The  RS  means  for  positive  and 

negative  presentation  polarity  conditions  are  interspersed 

on  the  figures,  showing  no  spread  of  the  scatter  (nor 
2 

reduction  in  R  )  when  either  metric  is  applied  to  both 
polarities . 

Working  directly  from  these  metrics,  100%  legibility  is 
predicted  to  require  a  displayed  luminance  ratio  of  2.8,  or 
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Figure  22.  AE  in  luminance-generalized,  AE- standardized 
(L*,u*,v*)  space  by  RS  scattergram, 

Study  2  (Lippert,  1985). 
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equivalently,  a  displayed  chrominance-difference  of  0.6  (on 
the  2.2:1  rescaled  u' :v'  diagram).  The  100%  legibility 
predictions  are  for  contrast  conditions  greater  than  any 
tested  in  either  study  because  asymptotic  performance  was 
obtained  with  combinations  of  the  highest  luminance  ratios 
and  chrominance-differences  tested. 

The  generalized,  standardized  metric  expressions  are 
listed  in  Table  3,  which  includes  each  metric's  color 
component  weightings  as  described  in  the  INTRODUCTION.  It 
is  important  to  note  that  the  AE  metrics  describe  RS 
consistently  across  the  0.5  m  and  0.76  m  viewing  distances, 
although  they  could  not  be  expected  to  do  so  for  any  viewing 
distance,  character  size,  dot  width,  etc.  It  might  be  found 
through  further  experimentation  that  the  luminance  contrast 
to  chrominance  contrast  weighting,  R,  while  determined  to  be 
constant  for  Studies  1  and  2,  might  vary  with  the  spatial 
content  of  the  imagery  of  interest. 

A  somewhat  simpler  form  of  metric  will  now  be  considered. 
Researchers  (e.  g.,  Boynton,  1981,  and  others)  working  on 
the  visual  distinctness  of  bordering  multicolor  fields  and 
the  physiology  of  primate  vision  have  suggested  that  the 
difference  in  stimulation  rate  of  the  long-wavelength 
receptors  at  a  T/B  border,  AX,  might  better  describe  RS  with 
isoluminous  T/B  pairs  than  does  AE.  RS  for  the  averaged  M  = 
0.006  conditions  in  Study  1  were  subjected  to  regression 
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2  2 

analysis  with  AX,  resulting  in  r  =0.52,  compared  to  r  = 

0.74  for  a  2.2:1  (u1 :v' )  rescaled  metric.  For  Study  2 

2 

isoluminous  pairs,  r  =  0.54  and  0.75  for  the  AX  and 

rescaled  (u',v')  metrics,  respectively. 

There  are  slight  amounts  of  luminance  contrast  among  the 

most  nearly  isoluminous  pairs  tested  and  AX  does  incorporate 

some  luminance  contrast  information,  while  the  (u',v') 

metric  does  not.  AX  is  certainly  a  more  parsimonious  means 

of  RS  prediction,  and  it  works  reasonably  well  for  the 

isoluminous  case,  but  not  as  well  as  rescaled  (u',v').  For 

2 

rescaled  (Y,u',v'),  r  =  0.74  and  0.75,  respectively,  for 

2 

Studies  1  and  2,  differing  from  the  r  s  for  rescaled  (u',v') 

only  beyond  the  second  decimal  place.  When  used  to  describe 

2 

the  entire  Study  1  data  set,  r  =0.18  for  AX,  indicating 

the  limit  of  its  effectiveness  compared  to  rescaled 

2 

AE(Y,u',v'),  for  which  the  simple  linear  r  =  0.49.  For  the 

2  2 

complete  Study  2  data  set,  r  =0.20  versus  r  =0.54  for  AX 
and  rescaled  Y,u',v'  space,  respectively. 

» 

For  color  contrast  including  luminance  contrast,  a  two- 
component  formulation  might  be  considered  in  which  the  root 
sum  of  squares  of  AY  and  AX  are  represented.  For  RS  in 

Study  1,  a  2.2:1  Y:X  rescaling  is  required  for  the  maximum 

2  2 
obtainable  r  =  0.43  and,  for  Study  2,  r  =  0.45,  which 

compare  reasonably  well  with  the  linear  Y,u',v'  models  in 

the  preceding  paragraph. 


While  it  appears  there  is  some  loss  of  descriptive 

information  in  the  rescaled  (Y,X)  metric  relative  to  the 

rescaled  (Y,u' ,v' ),  (Y,X)  is  a  reasonably  good  predictor  of 

2 

RS .  A  rescaled  (Y,X)  metric,  comprised  of  AE/AE  /N  terms, 

2 

as  those  reported  in  Table  2,  results  in  R  =  0.84  as 

2 

compared  to  the  R  =  0.92  for  rescaled  (Y,u',v')  with 

2 

respect  to  the  entire  Study  1  data  set.  For  Study  2,  R  s  = 
0.85  and  0.95  for  the  three-term  rescaled  (Y,X)  and 
(Y,u',v')  metrics,  respectively. 

The  rescaled  (u',v')  combinations  with  Y,  logY,  or  L*  are 
each  superior  to  either  the  (X)  or  (Y,X)  metrics  in  the 
accuracy  of  their  description  of  RS  whether  luminance 
contrast  is  present  or  not.  The  single  rescaling  ratio  of 
2.2:1  for.  u'  : v '  and  Y:X  to  obtain  maximum  r  s  is  ruled  a 
coincidence  unless  further  study  shows  otherwise. 

In  summary,  the  results  of  the  two  studies  are  consistent 
and  conclusive.  Several,  simple,  first-order  AE  scales 
exist  which  serve  equally  well  to  describe  or  predict  RS 
with  multicolor  CRT  raster  imagery  for  a  range  of  character 
luminances,  over  two  viewing  distances,  and  in  both  positive 
and  negative  presentation  polarities.  These  are  the 
Y,u',v',  logY,u',v',  L*,u',v',  and  L*,u*,v*  rescaled  color 
spaces  (Table  3)  .  Because  of  its  predictive  accuracy  and 
simplicity,  a  luminance-generalized,  AE-standardized  Y,u',v' 
metric,  accounting  for  71%  and  75%  of  the  RS  variability  in 


Studies  1  and  2,  respectively,  is  recommended  as  the  most 
appropriate  metric  of  emissive  display  legibility  to  be 
tested  in  these  studies. 

Even  greater  predictive  power  can  be  obtained  with  a 
second-order  equation  using  the  rescaled  axes,  as 
illustrated  in  Table  2.  Again,  the  Y,u',v'  space  is 
slightly  superior  in  its  predictive  power  and  is  recommended 
due  to  its  non-convergent  geometry  and  its  inherent 
simplicity.  The  second-order  equations  predict,  on  the 
average,  about  20  percent  more  of  the  RS  than  do  the  first- 
order  equations.  Thus,  while  the  first-order  equations  are 
simpler  and  more  conservative,  the  second-order  equations 
are  recommended,  within  the  limits  of  the  present  research 
parameters,  for  more  precise  prediction. 


APPLICATIONS 
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The  lack  of  statistical  dependency  between  the  AE  and  N 
effects  suggests  that  for  many  information  extraction  tasks, 
the  AE  metrics  described  might  perform  operationally  in  an 
accurate  relative  manner.  That  is,  factors  other  than  color 
contrast  might  be  found  to  contribute  in  constant,  additive 
fashions  to  task  performance  speed,  as  did  N. 
Unfortunately,  logic  dictates  that  such  realistic  attributes 
as  image  dynamics  and  character  surround  spatial  complexity 
must,  by  definition,  alter  the  simple  specifications  of 
color  contrast  permitted  by  this  laboratory  study  and, 
therefore,  cannot  be  independent  of  AE .  To  date,  the  only 
known  experiment  to  test  the  legibility  of  numerals 
displayed  against  spatially  complex  multicolor  backgrounds 
with  a  shadowmask  CRT  was  conducted  by  Post  (1983).  AE 
scales  (from  the  convergent  and  non-convergent  spaces 
tested)  based  on  Post's  indirectly  determined  color 
parameters,  do  not  describe  Post' s  measure  of  RS  with  static 
imagery.  It  appears  that  dynamic  imagery  would  be  an  even 
less  likely  candidate  for  legibility  modeling  AE  metrics  of 
the  types  considered  in  this  report. 


However,  it  is  hoped  that  the  legibility  metrics  reported 
here  will  find  operational  confirmation  in  daylight  color 
CRT  environments,  airborne  head-down  displays,  etc.  In  such 
cases,  displayed  color  parameters  will  have  to  be  corrected 
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for  the  ambient  illumination,  which  may  vary  spatio- 
temporally  and  require  the  estimation  of  "worst  case" 
conditions . 

In  a  cockpit  environment,  for  example,  RS  might  be 
determined  for  a  relatively  small  sample  of  color  contrast 
conditions  spanning  at  least  the  luminance  contrast  range 
tested  in  the  present  research.  If  AE  versus  RS  regression 
analyses  prove  similar  to  those  reported  here,  it  would  be 
reasonable  to  assume  a  global,  or  uniform,  property  for  the 
metric  in  the  new  application. 

There  are  many  task  environments  such  as  CAD/CAM  stations 
and  office  VDTs  where  the  legibility  metrics  reported  here 
should  find  direct  application.  Because  numeral  reading 
accuracy  was  high  across  the  range  of  numeral  reading  speed, 
it  might  be  possible  to  employ  increasing  steps  in  AE  to 
color  displayed  information  one  wishes  to  separate  into 
increasing  levels  of  importance  or  salience. 

Although  the  present  findings  relate  specifically  to  the 
legibility  of  CRT  raster  imagery,  it  is  suggested  that  the 
applicability  of  the  AE  metrics  be  tested  for  discrete 
picture  element  electronic  imaging  technologies  such  as 
liquid  crystal  displays  (LCDs)  and  full-color 
electroluminescent  displays.  In  electronic  display  viewing 
environments  in  which  the  individual  picture  elements  are 
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not  readily  distinguishable  by  the  user,  that  is,  in  well 

designed  viewing  conditions,  it  might  prove  that  spatial 

content  differences  between  discrete  element  and  CRT  raster 

images  do  not  significantly  affect  legibility.  In  such  a 

case,  the  metrics  reported  here  should  apply  to  the  discrete 

2 

element  display  technology,  at  least  across  a  10-50  cd/m 
image  luminance  range. 

Where  AE  is  shown  to  act  independently  in  its  effect  on 
visual  performance,  experimentation  might  lead  to 
operational  metrics  which  could  be  employed  in  reverse. 
Time  spent  on  a  visual  information  extraction  task  might  be 
"metered"  through  the  specification  of  color  contrast  to 
obtain  the  desired  (i.  e.,  reduced)  level  of  performance, 

effectively  loading  operator  faculties  a  known  amount. 


CONCLUSIONS 


There  are  several  important  conclusions  to  be  drawn  from 
this  work.  In  visual  environments  where  electronically 
displayed  contrast  between  symbology  and  uniform  backgrounds 
is  limited  to  a  luminance  modulation  less  than  0.27  (i.  e., 
a  luminance  contrast  ratio  less  than  1.6),  legibility  can  be 
enhanced  by  employing  either  characters  or  backgrounds  of 
red  and  magenta  hues.  This  finding  was  obtained  in  Study  1, 
where  relatively  high  levels  of  RS  were  found  for  red 
numerals,  no  matter  the  background  hue,  even  without 
luminance  contrast.  Overall,  a  luminance  contrast  ratio  of 
1.6  might  be  considered  minimally  adequate  under  ideal 
viewing  conditions,  those  free  of  vibration,  glare, 
luminance  adaptation  effects,  etc. 

It  is  interesting  that  the  suprathreshold  performance 
investigated  in  the  present  work  is  well  modeled  by  a 
chrominance  space  altered  only  slightly  from  one  determi*.  ed 
to  be  the  best  available  for  threshold  modeling.  In  many 
areas  of  science,  a  disagreement  between  empirical  findings 
in  matching  theoretical  models  of  2:1  is  considered  trivial. 
However,  the  (u',v')  construct  is  an  empirical  development 
and  the  2.2:1  disparity  found  in  this  research  questions  the 
adequacy  of  existing  colorimetric  methodologies  and  argues 
for  continued  research  toward  a  more  coherent  color  science 
of  visual  perception  and  performance. 


The  CIE  1976  L*,u*,v*  metric  is  presently  finding  general 
application  as  a  predictor  of  many  types  of  color  visual 
behavior,  largely  because  it  has  been  the  available  standard 
since  its  inception  and  is  empirically  validated  to  be 
reasonably  uniform  with  respect  to  threshold  levels  of  color 
contrast  containing  little  or  no  luminance  contrast.  The 
present  research  has  determined  that  convergent  metrics  such 
as  L*,u*,v*  are  inappropriate  bases  for  the  prediction  of 
any  visual  performance  measure  which  is  monotonically 
related  to  luminance  contrast.  However,  it  has  also  been 


demonstrated 

that 

the 

L* ,  u 

*,v*  axes. 

if  linearly. 

independently 

rescaled. 

yield 

a  metric  of 

RS  predictive 

power  similar 

to 

that 

of 

the  non-convergent  metrics 

reported.  In 

fact. 

the 

great 

expansion  of 

L*  in  rescaled 

L*,u*,v*  indicates  that  the  statistical  rescaling  procedure 
increases  the  effect  of  luminance  contrast  in  the  rescaled 
AE  metric  and  serves  to  reduce  the  rescaled  space's  rate  of 
convergence  with  respect  either  to  luminance  or  chrominance 
units . 

Color  difference  in  non-convergent  3-spaces  always  varies 
monotonically  with  luminance  difference.  Appropriate  non- 
convergent  combinations  of  any  of  several  luminance  scalings 
with  rescaled  (u',v')  provides  AE  metrics  which  describe  RS 
well  for  both  positive  and  negative  presentation  polarities, 
simplifying  the  image  engineering  process. 
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The  luminance-generalized,  AE-standardized  expressions  of 
all  of  the  metrics  developed  through  this  work  yield  nearly 
identical  predictions  in  terms  of  percent  improvement  in  the 
legibility  of  CRT  raster  imagery  and  perhaps  will  be  shown 
to  apply  to  other  emissive  display  imagery.  The  AE 
formulations  in  Table  3  are  designed  specifically  for 
emissive  displays.  The  non-convergent  spaces  reported  here 
do  not  include  the  specification  of  illuminants,  etc.,  which 
are  meaningless  in  pure  self-luminous  image  characterization 
and  require  additional  computations  to  yield  measures  of  AE. 
Because  of  its  simplicity  and  RS  predictive  power,  Y,u',v' 
space  in  a  luminance-generalized,  AE-standardized  form  is 
the  most  highly  recommended  design  aid  for  legible 
multicolor  CRT  raster  imagery  to  come  from  these  studies. 

A  2.2:1  (u1 ,v' )  rescaling  results  in  chrominance 
differences  which  uniformly  describe  RS  with  all  of  the 
isoluminous  color  contrast  conditions  tested.  The  (u',v') 
rescaling  ratio  remains  2.2:1,  in  combination  with  any  of 
the  luminance  scalings  tested,  to  describe  RS  for  the  color 
contrast  conditions  including  luminance  contrast.  This  is 
one  argument  for  a  model  of  color  visual  recognition  in 
which  luminance  contrast  and  chrom. nance  contrast 
components,  as  defined  by  the  C IE,  are  independent  in  their 
effects.  Given  the  energies  previously  spent  investigating 
various  luminance  x  chrominance  interaction  effects  inherent 
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to  color-  and  brightness-matching  performance,  which  were 
briefly  discussed  in  the  INTRODUCTION,  the  apparent  lack  of 
similar  effects  on  RS  deserves  further  study. 

Perhaps  most  importantly,  the  assumption  that  the 

contribution  to  legibility  of  a  specific  combination  of 

luminance  contrast  and  chrominance-difference  units  is 

invariant  with  respect  to  the  absolute  level  of  retinal 

stimulation  is  supported  for  stimulus  combinations  ranging 

2 

in  luminance  from  10  to  50  cd/m  .  This  contrast  "constancy" 
effect  might  be  shown  to  generalize  to  higher  luminance 
imagery  through  continued  effort. 
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APPENDIX  A.  Stimulus  Color  Parameters: 
Studies  1  and  2,  (Lippert,  1984,  1985) 

Table  4.  Stimulus  Color  Parameters:  Study  1. 

---1931  CIE - 


Color  Name 

Code 

M  Level 

X 

y 

Y  (cd/m  ) 

Numerals  (T) 

Achromatic 

ACH 

all 

.310 

.316 

46.3 

Yellow-green 

Y-G 

all 

.370 

.472 

45.5 

Red 

RED 

all 

.605 

.344 

48.1 

Backgrounds  (B) 

Purple 

P 

.006 

.383 

.222 

47.9 

.060 

.378 

.220 

42.8 

.  120 

.380 

.220 

38.6 

.  164 

.380 

.219 

34.9 

.210 

.372 

.220 

31.0 

.270 

.373 

.219 

27.5 

.316 

.374 

.218 

25.0 

Violet 

V 

.006 

.253 

.202 

46.0 

.060 

.255 

.  191 

42.6 

.  120 

.254 

.  190 

38.4 

.  164 

.255 

.  190 

34.5 
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Table  4,  continued 


Table  5.  Stimulus  Color  Parameters:  Study  2. 
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